Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning
نویسندگان
چکیده
Recently, there is an emerging trend to apply deep reinforcement learning solve the vehicle routing problem (VRP), where a learnt policy governs selection of next node for visiting. However, existing methods could not handle well pairing and precedence relationships in pickup delivery (PDP), which representative variant VRP. To address this challenging issue, we leverage novel neural network integrated with heterogeneous attention mechanism empower automatically select nodes. In particular, specifically prescribes attentions each role nodes while taking into account constraint, i.e., must precede node. Further masking scheme, expected find higher-quality solutions solving PDP. Extensive experimental results show that our method outperforms state-of-the-art heuristic model, respectively, generalizes different distributions sizes.
منابع مشابه
Solving a Practical Pickup and Delivery Problem
We consider a pickup and delivery vehicle routing problem commonly encountered in real-world logistics operations. The problem involves a set of practical complications that have received little attention in the vehicle routing literature. In this problem, there are multiple carriers and multiple vehicle types available to cover a set of pickup and delivery orders, each of which has multiple pi...
متن کاملDeep Reinforcement Learning for Solving the Vehicle Routing Problem
We present an end-to-end framework for solving Vehicle Routing Problem (VRP) using deep reinforcement learning. In this approach, we train a single model that finds near-optimal solutions for problem instances sampled from a given distribution, only by observing the reward signals and following feasibility rules. Our model represents a parameterized stochastic policy, and by applying a policy g...
متن کاملProblem solving with reinforcement learning
This thesis is concerned with practical issues surrounding the application of reinforcement learning techniques to tasks that take place in high dimensional continuous state-space environments. In particular, the extension of on-line updating methods is considered, where the term implies systems that learn as each experience arrives, rather than storing the experiences for use in a separate oo-...
متن کاملModelling and Solving the Capacitated Location-Routing Problem with Simultaneous Pickup and Delivery Demands
In this work, the capacitated location-routing problem with simultaneous pickup and delivery (CLRP-SPD) is considered. This problem is a more realistic case of the capacitated location-routing problem (CLRP) and belongs to the reverse logistics of the supply chain. The problem has many real-life applications of which some have been addressed in the literature such as management of liquid petrol...
متن کاملSolving the Vehicle Routing Problem with Simultaneous Pickup and Delivery by an Effective Ant Colony Optimization
One of the most important extensions of the capacitated vehicle routing problem (CVRP) is the vehicle routing problem with simultaneous pickup and delivery (VRPSPD) where customers require simultaneous delivery and pick-up service. In this paper, we propose an effective ant colony optimization (EACO) which includes insert, swap and 2-Opt moves for solving VRPSPD that is different with common an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Intelligent Transportation Systems
سال: 2022
ISSN: ['1558-0016', '1524-9050']
DOI: https://doi.org/10.1109/tits.2021.3056120